Greedy Selection of Species for Ancestral State Reconstruction on Phylogenies: Elimination Is Better than Insertion
نویسندگان
چکیده
Accurate reconstruction of ancestral character states on a phylogeny is crucial in many genomics studies. We study how to select species to achieve the best reconstruction of ancestral character states on a phylogeny. We first show that the marginal maximum likelihood has the monotonicity property that more taxa give better reconstruction, but the Fitch method does not have it even on an ultrametric phylogeny. We further validate a greedy approach for species selection using simulation. The validation tests indicate that backward greedy selection outperforms forward greedy selection. In addition, by applying our selection strategy, we obtain a set of the ten most informative species for the reconstruction of the genomic sequence of the so-called boreoeutherian ancestor of placental mammals. This study has broad relevance in comparative genomics and paleogenomics since limited research resources do not allow researchers to sequence the large number of descendant species required to reconstruct an ancestral sequence.
منابع مشابه
Rapid maximum likelihood ancestral state reconstruction of continuous characters: A rerooting‐free algorithm
Ancestral state reconstruction is a method used to study the evolutionary trajectories of quantitative characters on phylogenies. Although efficient methods for univariate ancestral state reconstruction under a Brownian motion model have been described for at least 25 years, to date no generalization has been described to allow more complex evolutionary models, such as multivariate trait evolut...
متن کاملReconstructing Ancestral Genomic Orders Using Binary Encoding and Probabilistic Models
Changes of gene ordering under rearrangements have been extensively used as a signal to reconstruct phylogenies and ancestral genomes. Inferring the gene order of an extinct species has the potential in revealing a more detailed evolutionary history of species descended from it. Current tools used in ancestral reconstruction may fall into parsimonious and probabilistic methods according to the ...
متن کاملCritical threshold for ancestral reconstruction by maximum parsimony on general phylogenies
We consider the problem of inferring an ancestral state from observations at the leaves of a tree, assuming the state evolves along the tree according to a two-state symmetric Markov process. We establish a general branching rate condition under which maximum parsimony, a common reconstruction method requiring only the knowledge of the tree, succeeds better than random guessing uniformly in the...
متن کاملSelecting Genomes for Reconstruction of Ancestral Genomes
It is often impossible to sequence all descendent genomes to reconstruct an ancestral genome. In addition, more genomes do not necessarily give a higher accuracy for the reconstruction of ancestral character states. These facts lead to studying the genome selection for reconstruction problem. In this work, two greedy algorithms for this problem are proposed and tested on computer simulation dat...
متن کاملMaximum likelihood reconstruction of ancestral amino-acid sequences
Maximum-likelihood methods are used extensively in phylogenetic studies [3]. In particular, aminoacid sequences of ancestral species have been inferred using these methods [7]. Such ancestral reconstruction tasks aim at identifying either the most likely sequence in a specific ancestor species (marginal reconstruction), or the most likely set of ancestral states corresponding to all the ancestr...
متن کامل